AITopics | ground truth box

Collaborating Authors

ground truth box

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion

Neural Information Processing SystemsApr-29-2026, 23:20:45 GMT

Automated creation of synthetic traffic scenarios is a key part of validating the safety of autonomous vehicles (AVs). In this paper, we propose Scenario Diffusion, a novel diffusion-based architecture for generating traffic scenarios that enables controllable scenario generation. We combine latent diffusion, object detection and trajectory regression to generate distributions of synthetic agent poses, orientations and trajectories simultaneously. To provide additional control over the generated scenario, this distribution is conditioned on a map and sets of tokens describing the desired scenario. We show that our approach has sufficient expressive capacity to model diverse traffic patterns and generalizes to different geographical regions.

artificial intelligence, machine learning, scenario, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report (0.49)

Industry: Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.68)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.67)

Add feedback

Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion

Neural Information Processing SystemsFeb-17-2026, 10:29:51 GMT

Work done during an internship at Zoox 37th Conference on Neural Information Processing Systems (NeurIPS 2023).

artificial intelligence, machine learning, scenario, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report (0.49)

Industry: Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Robots (0.94)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.68)

Add feedback

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds

Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni

Neural Information Processing SystemsFeb-14-2026, 06:42:05 GMT

Neural Information Processing Systems http://nips.cc/

ground truth box, point cloud, segmentation, (11 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Target Detection of Safety Protective Gear Using the Improved YOLOv5

Liu, Hao, Qin, Xue

arXiv.org Artificial IntelligenceAug-12-2024

In high-risk railway construction, personal protective equipment monitoring is critical but challenging due to small and frequently obstructed targets. We propose YOLO-EA, an innovative model that enhances safety measure detection by integrating ECA into its backbone's convolutional layers, improving discernment of minuscule objects like hardhats. YOLO-EA further refines target recognition under occlusion by replacing GIoU with EIoU loss. YOLO-EA's effectiveness was empirically substantiated using a dataset derived from real-world railway construction site surveillance footage. It outperforms YOLOv5, achieving 98.9% precision and 94.7% recall, up 2.5% and 0.5% respectively, while maintaining real-time performance at 70.774 fps. This highly efficient and precise YOLO-EA holds great promise for practical application in intricate construction scenarios, enforcing stringent safety compliance during complex railway construction projects.

detection, loss function, protective equipment, (13 more...)

arXiv.org Artificial Intelligence

2408.05964

Country:

Europe > United Kingdom (0.14)
Europe > Switzerland (0.04)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Heidelberg (0.04)
(5 more...)

Genre: Research Report (0.84)

Industry:

Construction & Engineering (1.00)
Transportation > Ground > Rail (0.92)
Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Architecture (1.00)
Information Technology > Security & Privacy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)

Add feedback

Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion

Pronovost, Ethan, Ganesina, Meghana Reddy, Hendy, Noureldin, Wang, Zeyu, Morales, Andres, Wang, Kai, Roy, Nicholas

arXiv.org Artificial IntelligenceNov-16-2023

agent, scenario, trajectory, (15 more...)

arXiv.org Artificial Intelligence

2311.02738

Country:

North America > United States > California > San Francisco County > San Francisco (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report (0.85)

Industry: Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.68)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.67)

Add feedback

Unbalanced Optimal Transport: A Unified Framework for Object Detection

De Plaen, Henri, De Plaen, Pierre-François, Suykens, Johan A. K., Proesmans, Marc, Tuytelaars, Tinne, Van Gool, Luc

arXiv.org Artificial IntelligenceJul-5-2023

During training, supervised object detection tries to correctly match the predicted bounding boxes and associated classification scores to the ground truth. This is essential to determine which predictions are to be pushed towards which solutions, or to be discarded. Popular matching strategies include matching to the closest ground truth box (mostly used in combination with anchors), or matching via the Hungarian algorithm (mostly used in anchor-free methods). Each of these strategies comes with its own properties, underlying losses, and heuristics. We show how Unbalanced Optimal Transport unifies these different approaches and opens a whole continuum of methods in between. This allows for a finer selection of the desired properties. Experimentally, we show that training an object detection model with Unbalanced Optimal Transport is able to reach the state-of-the-art both in terms of Average Precision and Average Recall as well as to provide a faster initial convergence. The approach is well suited for GPU implementation, which proves to be an advantage for large-scale models.

artificial intelligence, ground truth, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2307.02402

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Generating Driving Scenes with Diffusion

Pronovost, Ethan, Wang, Kai, Roy, Nick

arXiv.org Artificial IntelligenceMay-29-2023

In this paper we describe a learned method of traffic scene generation designed to simulate the output of the perception system of a self-driving car. In our "Scene Diffusion" system, inspired by latent diffusion, we use a novel combination of diffusion and object detection to directly create realistic and physically plausible arrangements of discrete bounding boxes for agents. We show that our scene generation model is able to adapt to different regions in the US, producing scenarios that capture the intricacies of each region.

agent, arxiv, detection, (15 more...)

arXiv.org Artificial Intelligence

2305.18452

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Massachusetts (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)

Genre: Research Report (0.54)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.87)

Add feedback

Confident Object Detection via Conformal Prediction and Conformal Risk Control: an Application to Railway Signaling

Andéol, Léo, Fel, Thomas, De Grancey, Florence, Mossina, Luca

arXiv.org Artificial IntelligenceApr-17-2023

Deploying deep learning models in real-world certified systems requires the ability to provide confidence estimates that accurately reflect their uncertainty. In this paper, we demonstrate the use of the conformal prediction framework to construct reliable and trustworthy predictors for detecting railway signals. Our approach is based on a novel dataset that includes images taken from the perspective of a train operator and state-of-the-art object detectors. We test several conformal approaches and introduce a new method based on conformal risk control. Our findings demonstrate the potential of the conformal prediction framework to evaluate model performance and provide practical guidance for achieving formally guaranteed uncertainty bounds.

artificial intelligence, machine learning, prediction, (16 more...)

arXiv.org Artificial Intelligence

2304.06052

Country:

Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.86)

Industry: Transportation > Ground > Rail (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Add feedback

CornerNet : Detecting Objects as Paired Keypoints

#artificialintelligenceJun-25-2022, 13:36:02 GMT

CornerNet is a different object detection technique where we detects the objects bounding box by a paired key-points, the top-left corner and the bottom-right corner using a single convolution neural network. By detecting the key points, it eliminates the need of different anchor boxes commonly used in single stage detectors. In this paper by Hei Law and Jia Deng from Princeton University, they have introduced a new approach to object detection which outperforms all the single stage detectors. CornetNet introduces a new type of pooling layer called Corner Pooling, that helps localizing the corners. The Net achieves 42.2% AP on MS COCO dataset.

anchor box, module, prediction module, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.56)

Add feedback

Everything You Need to Know About Object Detection Systems

#artificialintelligenceMar-27-2021, 19:30:11 GMT

With the advent of deep learning, implementing an object detection system has become fairly trivial. There are a great many frameworks facilitating the process, and as I showed in a previous post, it's quite easy to create a fast object detection model with YOLOv5. However, understanding the basics of object detection is still quite difficult. It involves a lot of math, and the variable number of outputs/bounding boxes makes it harder to understand than image classification, where we know the number of outputs beforehand. With so many moving parts and new concepts introduced over the history of object detection, it certainly hasn't gotten easier. In this post, I'll distill all this history into a simple guide that explains all the details of object detection and instance segmentation systems. The classic image classification problem is very well known: given an image, can you find the class the image belongs to? We can solve any new image classification problem with ConvNets and transfer learning using pre-trained nets where Convnets are fixed feature extractors.

detection, grid cell, region proposal, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback